Perceptual segmentation and component selection in compact sinusoidal representations of audio

نویسندگان

  • Edward M. Painter
  • Andreas Spanias
چکیده

This paper presents two fundamental enhancements in a hybrid audio signal model consisting of sinusoidal, transient, and noise (STN) components. The first enhancement involves a novel application of a perceptual metric for optimal time segmentation for the analysis of transients. In particular, Moore and Glasberg’s model of partial loudness is modified for use with general signals and then integrated into a novel time segmentation scheme. The second and perhaps more significant STN enhancement is concerned with a new methodology for ranking and selection of the most perceptually relevant sinusoids.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sinusoidal Analysis-Synthesis of Audio Using Perceptual Criteria

This paper presents a new method for the selection of sinusoidal components for use in compact representations of narrowband audio. The method consists of ranking and selecting the most perceptually relevant sinusoids. The idea behind the method is to maximize the matching between the auditory excitation pattern associated with the original signal and the corresponding auditory excitation patte...

متن کامل

Tree and filter optimization for audio compression in a wavelet-based perceptual audio coder

This paper outlines a new perceptual low bit rate audio coding scheme based on adapted wavelet representations. It claims wavelet tree and filter adaptation attending to a perceptual entropy-based method. To achieve such adaptive structure, a periodized wavelet packet transform is performed for each audio frame. After the transform, the encoder employs scalar adaptive quantization, controlled b...

متن کامل

A Switched Parametric & Transform Audio Coder

In this paper, we present a system of sines+transients+noise modeling techniques that dynamically switches between parametric representations and transform coding based representations. The sines and noise are represented by parametric models using multiresolution sinusoidal modeling and Bark-band noise modeling, respectively. The transients are modeled by short regions of transform coding. In ...

متن کامل

FDMSM robust signal representation for speech mixtures and noise corrupted audio signals

The fixed dimension modified sinusoidal model (FDMSM) was recently proposed as an attractive candidate for compact representation of audio signals in adverse conditions. This paper aims to study the capability of the FDMSM signal representation for analysis and synthesis of speech mixtures as well as noisy audio signals corrupted by highly colored noise of babble and harmonic. Extensive simulat...

متن کامل

The effects of segmentation and redundancy methods on cognitive load and vocabulary learning and comprehension of English lessons in a multimedia learning environment

The present study was conducted with the aim of the effects of segmentation and redundancy methods on cognitive load and vocabulary learning and comprehension of English lessons in a multimedia learning environment.The purpose of this study is an applied research and a real experimental study. The statistical population of the present study includes all people aged 14 to 16 who are enrolled in ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001